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Abstract 

We present an analysis of Dutch cross serial depen- 
dencies in Head-driven Phrase Structure Grammar 
([P&S(1994)]). We start out from the assumption that 
causative and perceptual verbs, like auxiliaries, can 
lexically 'raise' the arguments of the non-finite verbs 
they govern to their own hst of arguments through 
"argument composition" ([H&N(1989)]). 

1 Introduction 

Dutch cross serial dependencies (DCSDs), well-known 
from (1) and (2), still challenge computational linguis- 
tics for an efficient treatment. 

(1) dat iki haara de nijlpaardeua zagi voereua 
that I her the hippos saw feed 

"that I saw her feed the hippos" 

(2) dat iki Henka haarg de nijlpaardeUg zagi 
that I Hcnk her the hippos saw 
helpeua voereug 

help feed 

"that I saw Henk help her feed the hippos" 

The problematic aspects of DCSDs are of course 
the bounded discontinuous relation between the NPs 
and the verbs of which they are arguments, indi- 
cated in (1) and (2) by the subscripted integers, 
and the recursiveness of the phenomenon. The con- 
struction is only licensed by members of two closed 
classes of verbs, the class of perceptual verbs like 
2«en("see"), /ioren( "hear") and woeZen( "feel"), and the 
class of causative verbs like /aten( "let /make") and 
/telpen("help"). In the analysis put forward here we 
emphasize this lexical aspect of the phenomenon; in 
our analysis DCSDs are strictly tied to the subcate- 
gorization and semantics of perceptual and causative 
verbs. We analyze them as verbs which select, apart 
from their subject, a nonfinite V-projection which de- 
notes an event. More particularly, as is proposed for 
German auxiliaries in [H&N(1989)], they subcatego- 
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rize for the arguments of the verb they govern, a mech- 
anism frequently referred to as argument composition 
or argument inheritance. 

Recently DCSDs have been analyzed in a non- 
standard version of HPSG^ in [Reape(fc.)]. In his so- 
called sequence union approach, the standard concept 
of phrase structure grammar (i.e. that a string is de- 
fined as the terminal yield of a phrase structure tree) 
is abandoned. Our analysis is more standard, in the 
sense that we only need to refer to the lexicon and 
the HPSG-mcchanism of structure sharing.'^ Our pre- 
ferred explanatory mechanism, argument composition, 
is not so much an additional mechanism as an ejfect 
which derives from careful specification of structure 
sharing, and structure sharing is already at the theo- 
retical core of HPSG. 

Furthermore, argument composition is indepen- 
dently motivated, because Dutch is like German with 
respect to the phrase-structural behaviour of auxil- 
iaries, and argument composition in German construc- 
tions with auxiliaries is well-motivated ([H&N(1989)]). 
So we have good reason to assume argument composi- 
tion present in the theory, regardless of DCSDs. 

2 Event Semantics in HPSG 

The choice of semantics in terms of a theory of events, 
known from [Davidson(1967)], offers interesting ad- 
vantages and explanations of logical and linguistic phe- 
nomena, motivating the development of a constraint- 
based version of it.^ So, in the spirit of event seman- 
tics we propose that main verbs like voeren{"ieed" ) 
in (3) should denote a discourse referent, which is in 
fact a very natural assumption. In (3) and throughout 
the paper, recurring |T]'s indicate structure sharing, 
that is token-identity of information, as is common us- 
age in HPSG. Note also that we follow [Borsley(1987)] 
in representing subjects as values of SUBJ and follow 
[P&;S(1994)] (chapter 9) in representing non-subject 

^Head-driven Phrase Structure Grammar, which the reader 
is presumed to be more or less familiar with (see [P&S(1994)]). 

^In fact our analysis differs from many previous analyses of 
DCSDs in that we do not refer to any 'additional' (often pow- 
erful) mechanisms (sequence union, head wrapping, repeated 
rightward head movement). 

^The combination of HPSG with (shallow) event seman- 
tics and translation to an event-based logical form originates 
with work on the EC-sponsored PLUS project, Esprit P5254, 
a "Pragmatics-based Language Understanding System". 
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arguments as values of COMPS. 
(3) 



PHON 
HEAD 



voeren ) 
MAJOR V 

VFORM BASE 

SUBJ ( NP[case]:[2] ) 
COMPS ( NP[acc]:[3] ) 
GOV ( ) 

CONT [dET event 
PARA |T| 



RESTR 



RELN 


FEED 


INST 


s 


ARGl 




ARG2 





LEX + 

The constraint-based event semantics of the base form 
verb voeren as it is depicted in (3), with the quasi- 
determiner event, should be interpreted as an existen- 
tially quantified event with a parameter [T] which is 
restricted to involve a relation of feeding, an argument 
with the role of agent which is associated^ with a se- 
mantic content |T] an d an argument associated with a 
semantic conteiit[F| which is the theme.® 

Here the value of DET is a 'shallow' representation 
of a quantifier,^ and the value of PARA, which is an 
abbreviation for 'parameter', is structure shared with 
the value of a feature INST which is short for 'instance'. 
We will suppose that the value of PARA corresponds 
with a discourse referent in the discourse represen- 
tation associated with a natural language expression, 
without formally defining this relation here. The value 
of RESTR, which abbreviates 'restrictions', is a set of 
constraints on the value of this parameter. 

3 An Argument Composition Analysis 

We assume that the clause structure of DCSDs is one 
where we have a binary left-branching verbal complex. 
This verbal complex then locally selects the sum of the 
arguments of the verbs which constitute it. We feel 
that a binary branching analysis is empirically moti- 
vated by auxiliary flip in the same way as auxiliary 
flip motivates a binary right-branching structure for 
the German verbal complex, following [H&N(1989)]. 



*Here and throughout the paper, "<I>:v|>" means "feature 
structure <& with as cONTENT-value W\ 

'We assume that our constraint-based event semantics is in- 
ductively translated to a level of underspecified logical form, 
and that this UT.F-level then can be mapped to a level of logical 
form and a model-theoretic interpretation. The auxiliary levels 
are not defined here, but cf. [Rentier(ms.)]. 

^The concept of semantics we will outline here will be shallow 
for instance in the sense that we do not discuss quantification as 
it is common-place in formal semantics. However, cf. chapter 8 
of [P&;S(1994)] for discussion of a treatment of quantifier scope 
which could be combined with our approach, if so desired. 



A governing auxiliary will apply argument composi- 
tion and raise all the complements from the governed 
vcrb(s) to become arguments of the auxiliary, as pro- 
posed in [H&N(1989)]. We assume that causative and 
perceptual verbs syntactically behave just like auxil- 
iaries in this respect. 

The difference between auxiliaries on the one hand 
and perceptual and causative verbs on the other we 
view as basically semantic. We take it that auxiliaries 
semantically more or less operate on events, affecting 
features for tense and aspect or modahty. Causative 
and perceptual verbs on the other hand will be ana- 
lyzed as events themselves, events which take other 
events as their argument, in general as a theme (viz., 
a value of ARG2, cf. the entry in (7) below). 

In chapter 9 of [P&S(1994)] the approach to local se- 
lection from [Borsley(1987)] is developed further and 
leads to the Valence Principle, which refers to the va- 
lence features SUBJ and COMPS through 'F': 

(4) Valence Principle Chapter 9, [P&S(1994)] 
In a headed phrase, for each valence feature F, 
the F value of the head-daughter is the con- 
catenation of the phrase's F value with the list 
of SYNSEM values of the F-daughters' value. 

The general effect of the principle on a phrase which 
is headed by some sign is that this headed sign can 
only become 'complete' (or saturated''^) if it is com- 
bined with the appropriate arguments. For example, 
in the case of a transitive verb, such a verb must find 
a subject NP (selected through SUBJ) and some ob- 
ject (selected through COMPS). If we assume a flat 
clause structure analysis of Dutch and we furthermore 
assume lexical signs like (3) and (7), then the imme- 
diate dominance statements (5) and (6) will suffice to 
describe the construction of Dutch we are concerned 
with here.'^ Here the H,S and C indicate that the 
daughters of the phrase include a head, a subject and 
complements, not necessarily in that order (cf. chap- 
ter 9 of [P&S(1994)] for details). Note that in addition 
to the valency features SUBJ and COMPS, we also as- 
sume the presence of the GOV-feature, ranging over 1 
complement:* 

(5) XP[lEX-] S,Ci,.., C„,H[G0V( ), LEX-h] 

(6) X[lEX+] H[G0V ( Ci ), LEX-I-] , C, 

The second schema is in a sense not a "phrase" 
structure schema but is instead a "cluster- formation" - 
schema. This is because normally the combination of 
two or more words leads to a sign which is LEX-, a 
phrasal sign, but here it leads to a 'complex word' 
which is LEX+. Also (6) is strictly binary: it takes one 
argument, namely the argument which is the value of 



''Actually, our analysis also presupposes the Head Feature 
Principle and Semantics Principle from [P&S(1994)]; cf. Figures 
1 and 2 for informal illustration. 

^Following discussions of Webelhuth, Ackerman, Sag and 
Pollard at WCCFL XIII, suggesting this for German, and Chung 
of Ohio State University originally suggesting this for Korean. 
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GOV. We arrange the lexicon so that any value of GOV 
will always be an unsaturated base form verb which is 
defined as LEX+ as well. By the Valency Principle, 
this selection requirement of the governm^ verb will 
be appropriately 'cancelled' after string concatenation 
during parsing. 

Central to our analysis of the case-markings of 
NPs in the Dutch Mittelfeld is the assumption from 
[Pollard(fc.)] that base forms of verbs do not assign 
any case to their subject. The value for the subject- 
NP's CASE-feature in (3), "case", is the supertype in 
the type hierarchy for those atomic types that are ap- 
propriate values of the feature CASE. So, the value 
CASE is the supertype of nom and acc in Dutch and En- 
glish, and in German also of dat and gen. The result 
of assigning the subject-NP this supertype for case in 
practice boils down to giving this NP some kind of 
"any" -value for case; the case- value case of such an 
NP will unify with any other possible case value. 

In our analysis, the discontinuous relation between 
arguments and verbs in DCSDs is brought about firstly 
by lexically defining finite perceptuals hke zag (and 
finite causatives) as argument composition verbs, along 
the following lines:® 



(7) 



HEAD 



PHON ( zag ) 

MAJOR v 
VFORM FIN 

SUB J ( NP[nom]:[T] ) 
COMPS ( [5]nP[acc] ) © |T] 



GOV 



V[base] 

SUBJ ( UlNP ) 

COMPS [l] 
CONT [7] 
LEX -I- 



CONTENT 



DET EVENT 

PARA [e] 



RESTR < 





RELN ZIEN 




< 


INST [e] 


> 




ARGl \T\ 






ARG2 [7] 


> 



LEX 



The finite argument composition verb zag selects a 
singular nominative NP through its SUBJ-feature. As 
non-subject arguments it selects through its COMPS- 
featurc first the NP tagged as |T] which is unified with 
the SUB J- value of the governed verb(s), and secondly 
the list [T] of zero or more non-subject arguments 
of the governed verb(s). And crucially, being a gov- 
eining verb, zag selects through GOV a governed base 
form verb,^° with as SUBJ-value "[s]", as COMPS-value 



" [l] " and as semantics "[T|". Note that, since the gov- 
erned V[bse] is selected as missing a subject and a list 
of complements, it must not 'find' this subject or these 
complements, which it indeed doesn't (cf. the tree in 
Figure 1). 

As it were in passing, the governing perceptual verb 
(or causative verb alike) imposes accusative case on 
the NP which denotes the subject-argument of the 
governed verb. The unification of [CASE case] and 
[CASE acc] will be forced through the structure-sharing 
indicated in (7) as "[s]", and will result in the more 
specific restriction [CASE acc]. This accounts for the 
accusative case-marking on haar ("her") in examples 
(1) and (2), and in general on all non-subject argu- 
ments in such constructions. 

The second and crucial step in our account of the 
discontinuity is accounting for the linear order in the 
verb cluster with DCSDs. The linear order of the verb 
cluster in Dutch we account for through (8): 

(8) Linear Precedence Rule Dutch Verb Clusters 
[GOV { X )] < X 

(9) Linear Precedence Rule German Verb Clusters 

X < [GOV ( X )] 

By these LP-rules, in each part of the binary branch- 
ing verb cluster the governing verb will appear head- 
initial in Dutch, and head-final in German. It is 
straightforward to show that the above approach has 
the desired efi^ect also for the sentence (2) mentioned 
in the introduction if we define a lexical entry for the 
causative helpen with a syntax and semantics along the 
same lines as the perceptual zag. The only difference 
must be that such nonfinite entries do not assign nom 
to their subject, but "case". Other than that, there 
will just be additional embeddings in the semantics as 
well as in the verb cluster. Thus, by the ID-rule in (6) 
and the lexical entries for causatives and perceptuals, 
we account for the recursiveness of the phenomenon, 
cf. the tree in Figure 2. 

4 Conclusion 

We extended the [H&N(1989)]-analysis of German 
to Dutch, accounting for the difli'erence, resp. nested 
vs. cross serial dependencies, through one single LP- 
parameter. Also, we argued that such an argument 
composition approach is to be preferred over seA'eral 
alternative approaches, since argument composition is- 
n't an 'additional' mechanism. Further linguistic ad- 
vantages of this approach, i.e. accounts of irrcgiUar 
case assignments and constraints on double infinitives, 
are discussed in [Rentier(1994)]. We are able to derive 
verb second constructions by standard application of 



^In this entry and throughout the paper, © stands for con- 
catenation of arbitrary-length lists of arguments. 

^"One base form verb, or a base form verb-headed verbal clus- 



ter; due to the ID-schema in (6) either will be lex+, so that 
we are able to recursively build up bigger and bigger lex+- 
complexes. 

-'-'LP-rules like these are common in HPSG, cf. for instance 
the rule XP < SUB.J ( XP ) which orders subjects before VPs 
in English ([Borsley(1987)]). 
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Figure 1: The discontinuous relation: Valence Principle, schema's (5) & (6), entries (3) & (7), LP-rule (8). 

S 



NP[nom]:[T] NP[acc]:[2] NP[acc]:[3] 



ik 



haar 



de n. 



'v[fin] 

SUBJ ( NP[nom]:[T] ) 

COMPS ( NP[acc]:[2] , NP[acc]:[3] ) 

GOV ( ) 



DET EVENT 
PARA 







RESTR < 



RELN SEE 

INST [e] 

ARGl [T] 

DET EVENT 
PARA [t] 

RELN FEED 



ARG2 



RESTR < 



INST [y] 

AEGl [2] 
ARG2 [3] 



GOV ( I 8 I 



V[fin] 

SUBJ ( NP[nOM]:[T] ) 

COMPS ( [5]np[acc] ) e [l] 

V[dse] 

SUBJ ( |~5]nP ) 

COMPS ( [T] ) 
GOV ( ) 
CONT [T] 
LEX + 

DET EVENT 

PARA rel 



RESTR < 



> 





RELN SEE 






INST [e] 




< 


ARGl [T] 


> 




ARG2 [T| 


J 



V[bse] 

SUBJ ( NP:[2] ) 
COMPS ( NP[acc]:[3] ) 
GOV ( ) 



DET EVENT 
PARA I 







RESTR < 





RELN FEED 


> 




INST [T] 




< 


ARGl [2] 


> 




ARG2 [3] 





zag 



voeren 
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Figure 2: Recursion in the Verb Cluster (Sentence (2)). 

"v[fin] 

SUBJ ( [a]NP[NOM] ) 

COMPS ( [b]NP[Acc] , [c]NP[acc] , [d]NP[ACC] ) 
GOV ( ) 
LEX + 



V[fin] 

SUBJ ( [a]NP[NOM] ) 

COMPS ( [b]NP[Acc] ) e I LI I 
V[bse] 

SUDJ ( [b] ) 
[1] COMPS I LI I ) 
GOV ( ) 
LEX + 



zag 



[1] 



V[bse] 

SDB.I ( NP[CASe] ) 

COMPS ( [c]NP[acc] , [d]NP[ACC] ) 

GOV ( ) 

LEX + 




V[bse] 

SUBJ ( NP[cASE] ) 

COMPS ( [c]NP[acc] ) © I L2 I 
V[bse] 

SUBJ ( [c] ) 
[2] COMPS I L2 I ) 
GOV ( ) 
LEX + 



[2] 



V[bse] 

SUBJ ( NP[cASE] ) 
COMPS ( [d]NP[ACC] ) 
GOV ( ) 
LEX + 



helpen 



voeren 



the Dutch versions of the extraction lexical rules (see 
[Rentier(1993)]) to the verbs at the lexical level. 
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